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ABSTRACT 

Methodology  developed  by  Blischke  (Ann.  Math,  Statist.  33,  (1962),_444 
54)  is  applied  to  estimate  the  parameters  in  a  model  of  faulty  inspection. 


and  to  obtain  approximate  formulae  for  the  variances  of  these  estimators. 
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INTRODUCTION 

Recent  papers  (Johnson  et  al.  (1980),  Johnson  &  Kotz  (1983),  Kotz  & 
Johnson  ( 1982 )) have  developed  distributions  of  observed  numbers  of 
apparently  defective  items  when  sample  inspection  is  imperfect,  resulting 
in  sane  defectives  not  being  observed  as  such,  and  possibly  some  non¬ 
defective  being  described  as  'defective1  ("false  positives").  Although 
these  results  are  of  interest,  some  more  practical  problems  arise  when  it 
is  desired  to  test  whether  the  inspection  is  faulty  or  to  estimate  the 
degree  of  imperfection.  In  Johnson  and  Kotz  (1985)  sane  tests  for 
detection  of  faulty  inspection  were  investigated.  The  present  paper  is 
devoted  to -the  estimation  aspects  of  the  problem.  We  will  consider  here 
the  simplest  form  of  inspection  by  attributes,  assuming  lot  size  to  be, 
effectively,  infinite.  Each  individual  in  a  randan  sample  of  size  n  is 
examined  and  a  decision  reached  as  to  whether  or  not  it  is  'nonconforming' 
(NC).  Ideally,  of  course,  such  decisions  should  be  canpletely  free  of 
error,  but,  as  is  well-known,  this  is  often  not  the  case.  As  a  model  of 
faulty  inspection,  we  introduce  two  parameters 

p  =  Pr[ individual  declared  NC  (  individual  ^s  NC] 
p'  *  Prlindividual  declared  NC  j  individual  _is  not  NC]. 
and  suppose  we  wish  to  estimate  these  parameters.  The  proportion,  P,  of  NC 
individuals  in  the  lot  is  unknown,  and  plays  the  role,  in  this  context,  of 
a  nuisance  parameter. 
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It  is  clearly  not  possible  to  estimate  p  and  p'  {or  P)  if  each 
individual  is  examined  only  once.  The  only  function  of  the  parameters 
which  can  be  estimated  fran  such  data  is  essentially  Pp  +  (l-P)p'  -  the 
probability  that  an  individual  chosen- at  randan  is  declared  NC  -  because 
the  distribution  depends  only  on  this  quantity. 


2^  ESTIMATION 

If  individuals  are  examined  rrore  than  once,  however,  it  is  possible  to 
estimate  each  of  the  three  parameters.  We  will  suppose  that  each  of  the  n 
individuals  in  the  random  sample  is  examined  on  m  independent  occasions. 

If  denote  the  number  of  times  the  i-th  individual  is  declared  to  be  NC, 
it  has  the  distribution 

_  d.  m-d.  d.  m-d. 

Pr[D.=d.]  =  PC  )  p  1(l-p)  1  +  (l-P)C  )  P'  x(l-p')  1  (1) 

i  i 

(d^  =  0,1,. ..,m). 


This  is  a  mixture  of  two  binomial  distributions,  with  parameters  (m,p)  and 
(m,p')  in  proportions  P,  (1-P)  respectively.  The  r-th  factorial  moment  of 


=  E[Di(D.-l)  ...  (Di-r+l)]  =m<r){Ppr+  (1-P)p’r>  (2) 

Estimating  the  parameters  by  making  sample  and  population  values  of 


the  first  three  factorial  moments  agree,  we  have 


Pp  +  (l-P)p’  =  Fr  (r=l,2,3> 


where  Fr  =  (m(r))-1n-1  ^  Di(r) 

i=l 


Solutions  p,  p*  and  P  of  (3)  are  given  by  Jones  (1933)  as  follows 

(a)  p,  p'  are  roots  (in  0)  of  the  equation 

02  -  A0  +  AF1  -  F2  =  0  (4 

where  A  =  (F^F-jF^/^-F-j2)  (note  that  AF.^  =  (F^-F^/^-F2) ) 

(b)  P=  (F-j-p'J/fp-p')  .  (5) 

There  is  indeterminacy  in  the  solution,  since  if  (p,p',P)  is  a 

solution,  so  is  (p',p,l-P).  We  will  adopt  the  convention  of  regarding  the 
greater  root  of  (4)  as  the  estimator  (p)  of  p.  It  is  reasonable  to  suppose 
that  p  is  greater  than  p'  -  that  is,  the  probability  of  declaring  an 
individual  to  be  NC  if  it  is,  indeed,  NC  is  greater  than  if  it  is  not. 
However,  it  must  be  remembered  that  even  if  this  is  so  (i.e.  p  >  p'),  this 
does  not  ensure  that  p  must  exceed  p ' . 


3.  Illustrative  Example 

For  purposes  of  calculation,  note  that 


/  D’ 

/_  i 

i=l 


D.(r)=)  N.  j(r) 

’  Z_  3  j 

j=l 


where  =  number  of  individuals  declared  NC  on  just  j  occasions  among  the 
m  times  examined. 


Suppose  we  have  n=50,  m=3;  Nn=43,  N. =1,  N_=l,  N-=5.  Then 


F  =  (3(r)  *  50)  ^ (43*  0  +  1*1'"'  +  1-2'"'  +  5-3“')  , 

r 


is  *!-m  (1+2+15)  ? 


F2  “  300  (2+30)  75  ; 


F3  ”  300  *  30  10  ' 


where  A  =  ~r  =  0.94508671  and  p,p'  are  roots  of  02-O. 945199  +  0.00674  =  0. 
346 

We  find  p  =  0.9379;  p'  =  0.0072;  P  =  0.1212. 

In  this  case  p,  p'  and  P  are  all  between  0  and  1. 

If  they  are  not  the  method  fails,  though  Blischke  (1962,  1964)  has 
suggested  rules  for  this  case. 


4.  Variances 

Blischke  (1962)  has  obtained  the  following  asymptotic  formulae  for  the 
variances  of  p,  p'  and  P; 


p(l-p)  2 (4B„+B„) 


6 (B-+B- ) 


Pnm  nrrUm-DC^  mn  (m-1 )  (m-2 ) 


p'(l-p')  2 (B_+4B1) 


6  (Bvt-Bi ) 


( 1-P ) nm  nm (m-1 ) Cl  nm  (m-1 )  (m-2 )  C ! 

2  4 


P(l-P)  18 (B-+BI)  24 (B,+BI ) 

var(P)  -  - +  - - - r  + - - — - - 

n  nm(m-l)  (p-p' r  nm(m-l)  (m-2)  (p-p* ) 
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-where  ^  =  Pph(l-p)h;  =  (l-P)p,h(l-p' )h 
=  P2(p-p*  )h;  =  (1-P)2(^p')h 

When  m  is  large,  the  first  term  of  each  expression  usually  gives  quite  good 
approximation,  so  that  we  may  take 

p(p-l)  p'(l-p')  _  P(l-P) 

var(p)  4=  -  ;  var(p')  *?  -  ;  var(P)  4"  -  .  (8) 

P  mn  (l-P)mn  n 

Note  that  the  denominators  are,  for  v^r(p),  the  expected  number  of 
examinations  of  NC  individuals;  for  var(p'),  the  expected  numbers  of 
examinations  of  conforming  individuals;  and  for  var(P)  the  number  of 
individual's  in  the  sample. 

Using  the  numerical  values  in  the  example  of  Section  3,  and  inserting 
the  values  p,p',P  for  p,p',P  respectively,  we  find  (using  (7.1)-(7.3) 
var(p)  4*  0.004133 
var(p')  4=  0.0000591 
var(P)  =  0.002170 

The  last  two  terms  in  the  expressions  on  the  right-hand  sides  of  (7.1)- 
(7.3)  are 


0.000885 

and 

0.000044 

for 

var  (p)  ; 

0.000004 

and 

0,000001 

for 

var(p')  ; 

0.000036 

and 

0.000003 

for 

var  (P). 

So  the  use  of  (8)  in  this  case,  at  least,  would  give  quite  good  results. 


.even  though  m  is  only  3.  (To  the  same  order  of  approximation  the  three 
estimators  are  uncorrelated. ) 


5.  Confidence  Intervals 

Blischke  (1962)  also  showed  that  the  asyrrptotic  distributions  of  the 
estimators  are  normal.  For  P,  approximate  100(l-ct)%  confidence  regions 
can  be  obtained  frcm  the  inequality 


n ( P-P ) 2 
P(l-P) 


_ .  .  y  2 

where  $(-\^a)  =  i a  and  |)(y)  =  (s/2lf)  1  5.^  e  *u  du  .  Taking  a  = 

2  2 

0.05,  so  that  \^a  =  \0  Q25  =  3.8416  the  approximate  95%  region  for  P  is 
50 (0. 1212-P) 2  <  3.8416  P(l-P) 


or  equivalently 

53.8416  P2  -  15.9616P  +  0.73447  <  0 
that  is  0.057  <  P  <  0.240  . 

Unfortunately  we  cannot  use  this  method  to  obtain  confidence  regions 
for  p  and  p'.  The  corresponding  region  for  p  (using  (8))  would  be 


P  mn(p-p)2 
p(l-p) 


which  cannot  be  used  because  P  is  not  known.  We  might  replace  P  by  P. 
This  would  give  an  asymptotically  correct  region. 

Using  the  value  P  =  0.1212,  and  taking  a  =  0.05,  as  before,  we  get 


the  (approximate)  95%  confidence  regions: 
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■for  p;  0.1212  •  150 (0. 9379-p)2  <  3.8416  p(l-p)  , 

whence  22.0216  p2  -  37.9436p  +  15.9922  <  0 
leading  to  the  interval  0.7350  <  p  <  0.9880  ; 
for  gj  18.18  (0.0072-p’)2  <  3.8416  p'(l-p') 

whence  22.0216  p'2  -  4.1034  p'  +  0.00094  <  0 
leading  to  the  interval  0.00023  <  p'  <  0.0650. 
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